Finding the vacant homes the city hasn't found yet.
A machine learning model that scores all 520,000 Philadelphia parcels for vacancy risk — surfacing properties likely to be vacant that don't appear in current city records.
A machine learning model that scores all 520,000 Philadelphia parcels for vacancy risk — surfacing properties likely to be vacant that don't appear in current city records.
Vacant properties are one of the most visible signs of disinvestment in a neighborhood. They attract illegal dumping, reduce property values for surrounding owners, create fire hazards, and signal to residents that a block is being left behind.
The city's official vacancy count, the Vacant Property Indicator, is compiled from Licenses and Inspections records, and it has a known gap. A building can sit empty for years before an inspector flags it or a neighbor files a complaint. The data reflects enforcement history, not ground truth.
That gap matters, because L&I can't inspect what it doesn't know about. Community development organizations, housing courts, and city planners deciding where to direct resources end up working from an incomplete picture.
This model was built to close part of that gap. It combines dozens of signals from public administrative data: code violation history, clean and seal actions, unsafe and imminently dangerous orders, business license records, building permits, parcel characteristics from OPA, and deed transfer history. The result is a probability score for every residential parcel in the city, and higher scores mean a property looks more like other properties that turned out to be vacant.
The goal is not to have a final determination of vacancy, not a lien or seizure trigger, and not a substitute for field judgement. It's to give the people doing the work a calibrated starting point, a prioritized list of addresses worth a second look based on data rather than chance or proximity to the last complaint.
The model produces two finished artifacts. Use the interactive dashboard to explore parcel-level scores across the city. Read the methodology report to understand how the model was built, validated, and what its limitations are.
An interactive map and table showing predicted vacancy risk scores for every residential parcel in Philadelphia. Filter by neighborhood, risk tier, or parcel type.
A full account of how the model was built: data sources, feature engineering, training approach, validation strategy, equity audit, and known limitations.
Philly Stat 360 is the City of Philadelphia's performance management initiative. We track how city government is doing — across every department, in plain language — and publish the results for every resident to see.
This vacancy risk model is part of a broader effort to use data to make city services more proactive — finding problems before they become crises, and doing it fairly.